A Morphological Processor for Malayalam Language

نویسندگان

  • Sumam Mary Idicula
  • Peter S. David
چکیده

Work on morphological analyzers (which are computer programmes) for Indian languages is conducted vigorously these days. Usually published in specialized journals, this rather technical work is briefly presented here to provide some insights to a wider readership into little-known aspects of current language work. The morphological strength of Malayalam as a major South Indian language justifies the use of thorough morphological processing, which is the first step in any natural language processing task. The project is aimed at building a morphological processor for language, with two main components: a morphological generator and a morphological analyzer. The computational model of the processor takes care of the processing of nouns, pronouns, verbs and modifiers. The results obtained are encouraging and the work can be extended to the creation of a full-fledged part-of-speech tagger for Malayalam and other Dravidian languages, since they all exhibit structural homogeneity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unity in Diversity: A Unified Parsing Strategy for Major Indian Languages

This paper presents our work to apply non linear neural network for parsing five r esource p oor I ndian L anguages belonging to two major language families Indo-Aryan and Dravidian. Bengali and Marathi are Indo-Aryan languages whereas Kannada, Telugu and Malayalam belong to the Dravidian family. While little work has been done previously on Bengali and Telugu linear transition-based parsing, w...

متن کامل

Automated Plagiarism Detection System for Malayalam Text Documents

In this paper, a plagiarism detection tool for plagiarism detection in Malayalam documents is presented. Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English. Detecting plagiarism in Malayalam documents is particularly a challenging task because of the complex linguistic structure of Malayalam. The plagiarism detectio...

متن کامل

Automated Plagiarism Detection System for Malayalam Text Documents

In this paper, a plagiarism detection tool for plagiarism detection in Malayalam documents is presented. Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English. Detecting plagiarism in Malayalam documents is particularly a challenging task because of the complex linguistic structure of Malayalam. The plagiarism detectio...

متن کامل

Automated Plagiarism Detection System for Malayalam Text Documents

In this paper, a plagiarism detection tool for plagiarism detection in Malayalam documents is presented. Many language-sensitive tools for detecting plagiarism in natural language documents have been developed, particularly for English. Detecting plagiarism in Malayalam documents is particularly a challenging task because of the complex linguistic structure of Malayalam. The plagiarism detectio...

متن کامل

Clause Boundary Identification for Malayalam Using CRF

This paper presents a clause boundary identification system for Malayalam sentences using the machine learning approach CRF (Conditional Random Field).Malayalam Language is considered as a 'Left branching language' where verbs are seen at the end of the sentence. Clause boundary identification plays a vital role in many NLP applications and for Malayalam language, the clause boundary identifica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007